Influential Gene Selection From High-Dimensional Genomic Data Using a Bio-Inspired Algorithm Wrapped Broad Learning System

نویسندگان

چکیده

The classification of high dimensional gene expression/ microarray data always plays an important role in various disease diagnoses and drug discovery. To avoid the curse dimensionality, selection most influential genes remains a challenging task for researchers machine learning field. As extraction features by bio-inspired algorithm is non-deterministic polynomial-time (NP-Hard) task, possibility applying new there. In this suggested work, recently developed algorithm, Monarch Butterfly Optimization (MBO), wrapped with Broad Learning System (BLS), called MBO-BLS, to choose classify simultaneously. first stage, pre-selection method (Relief) used select feature subset. Then, selected subset undergoes further execution MBO-BLS model. estimate efficacy presented model, six cancerous datasets are taken. Here, sensitivity, specificity, precision, F-score, Kappa, MCC measures impartial comparison. Further, prove supremacy method, basic BLS, Genetic Algorithm BLS (GA-BLS), Particle Swarm (PSO-BLS), existing ten models taken Moreover, examine designed model statistically, Analysis variance (ANOVA) test also performed here. From above qualitative quantitative analysis, it concluded that proposed outclasses other considering models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Learning Methods for High Dimensional Genomic Data Statistical Learning Methods for High Dimensional Genomic Data Title: Statistical Learning Methods for High Dimensional Genomic Data

Due to their high-dimensionality, -omics technologies require the development of computational methods that are able to work with large number of variables. Each data type is characterized by its method of measurement and by the biological aspect under study. Understanding the data properties allows the design of sophisticated and effective computational models that are able to uncover and expl...

متن کامل

Bio-inspired Broad-class Phonetic Labelling

Recent studies have shown that the correct labeling of phonetic classes may help current Automatic Speech Recognition (ASR) when combined with classical parsing automata based on Hidden Markov Models (HMM). Through the present paper a method for Phonetic Class Labeling (PCL) based on bio-inspired speech processing is described. The methodology is based in the automatic detection of formants and...

متن کامل

Feature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach

Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...

متن کامل

De-Identification of Health Data in Big Data using a Novel Bio-Inspired Apoptosis Algorithm

part of this journal may be reproduced or used in any form or by any means without written permission from the publisher, except for noncommercial, educational use including classroom teaching purposes. Product or company names used in this journal are for identification purposes only. Inclusion of the names of the products or companies does not indicate a claim of ownership by IGI Global of th...

متن کامل

Genomic Selection GENOMIC SELECTION USING A FAST EM ALGORITHM 2. ANALYSIS OF SIMULATED DATA

The paper reports on a fast EM algorithm for genomic selection by mapping QTL in genomewide dense SNP marker data. The algorithm called emBayesB was used to analyse a 6000 SNP dataset simulated for the QTLMAS XII workshop. True breeding value was accurately predicted by GEBV with a correlation of 0.85 in the validation data, while the regression coefficient of 0.97 indicated unbiased prediction...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3170038